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IN THE CLAIMS : 

Applicants, pursuant to 37 C.F.R. § 1.121, submit the following amendments to the claims: 

1. (Currently amended) A method for selecting epigenetic features, comprising the 
steps of: 

a) collecting and storing biological samples containing mammalian genomic DNA; 

b) collecting and storing available phenotypic information about the biological samples so 
as to define a phenotypic data set; 

c) defining at least one phenotypic parameter of interest; 

d) dividing the biological samples into at least two disjunct phenotypic classes of interest 
using the defined phenotypic parameters of interest; 

e) selecting pairs of classes or pairs of unions of classes from the disjunct phenotypic 
classes of interest; 

f) defining, for each selected pair, an initial set of epigenetic features of interest; 

g) analyzing the defined epigenetic features of interest of the biological samples so as to 
generate an epigenetic feature data set for each pair; 

h) selecting relevant epigenetic features of interest and/or combinations of epigenetic 
features of interest of the defined epigenetic features of interest, the relevant epigenetic features of 
interest and/or combinations of epigenetic features of interest being relevant for epigenetically- 
based prediction of each pair of classes or pair of[[or]] unions the at least two phenotypic classes 
of interest; 

i) defining a new set of epigenetic features of interest based on the relevant epigenetic 
features of interest and/or combinations of epigenetic features of interest generated in step g); and 

j)[[J)]] performing epigenetically-based prediction of each pair of classes or pair of unions 
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of classes using a machine learning classifier. 

2. (Previously presented) The method as recited in claim 1, further comprising 
repeating steps g) and h) based on the new set of epigenetic features of interest defined in step i). 

3. (Previously presented) The method as recited in claim 1, wherein the biological 
samples include at least one of: cells; cellular components which contain DNA; sources of DNA; 
tissue embedded in paraffin; and histological object slides. 

4. (Previously presented) The method as recited in claim 3, wherein the sources of 
DNA include at least one of: cell lines; biopsies; blood; sputum; stool; urine; and cerebral- spinal 
fluid. 

5. (Previously presented) The method as recited in claim 3, wherein the tissue 
embedded in paraffin includes at least one of tissue from eyes, intestine, kidney, brain, heart, 
prostate, lung, breast and liver. 

6. (Previously presented) The method as recited in claim 1, wherein at least one of the 
phenotypic information and the phenotypic parameter of interest are selected from the group 
consisting of: kind of tissue; drug resistance; toxicology; organ type; age; life style; disease 
history; signaling chains; protein synthesis; behavior; drug abuse; patient history; cellular 
parameters; treatment history; gene expression; and combinations thereof. 

7. (Previously presented) The method as recited in claim 1, wherein the epigenetic 
features of interest include cytosine methylation sites in DNA. 

8. (Previously presented) The method as recited in claim 1, wherein the initial set of 
epigenetic features of interest is defined using preliminary knowledge data about their correlation 
with phenotypic parameters. 

9. (Previously presented) The method as recited in claim 1, wherein the relevant 
epigenetic feature or a combination of epigenetic features is relevant for epigenetically-based 
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prediction of said phenotypic classes of interest when at least one of an accuracy and a 
significance of the epigenetically-based prediction of the phenotypic classes of interest is likely to 
decrease by exclusion of the corresponding epigenetic feature data of the epigenetic feature data 
set. 

10. (Currently amended) The method as recited in claim 1, wherein step e)[[d]]) is 
performed so as to select pairs of unions of classes from the at least divide the biological samples 
in two disjunct phenotypic classes of interest. 

11. (Currently amended) The method as recited in claim 10, further comprising 
performing , in j), the epigenetically-based prediction of the pairs of unions of at least two 
phenotypic classes of interest using a machine learning classifier. 

12. (Cancelled) 

13. (Currently amended) The method as recited in claim 1, wherein the selecting of 
step h)[[g)]] includes: 

defining a candidate set of and/or combinations of epigenetic features of interest of the 
defined epigenetic features of interest; 

defining a feature selection criterion; 

ranking the candidate set of epigenetic features of interest and/or combinations of 
epigenetic features of interest according to the feature selection criterion; and 

selecting the highest ranking epigenetic features of interest and/or combinations of 
epigenetic features of interest. 

14. (Previously presented) The method as recited in claim 13, wherein the candidate set 
of epigenetic features of interest is the set of all subsets of the defined epigenetic features of 
interest. 

15. (Previously presented) The method as recited in claim 13, wherein the candidate set 
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of epigenetic features of interest is a set of all subsets of a given cardinality of the defined 
epigenetic features of interest. 

16. (Previously presented) The method as recited in claim 13, wherein the candidate set 
of epigenetic features of interest is a set of all subsets of cardinality 1 of the defined epigenetic 
features of interest. 

17. (Original) The method as recited in claim 13, wherein the defining the candidate set 
of epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by subjecting the epigenetic feature data set to principal component analysis, principal 
components of the principal component analysis defining the candidate set of epigenetic features 
of interest and/or combinations of epigenetic features of interest. 

18. (Withdrawn) The method as recited in claim 13 wherein the defining the candidate 
set of epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by subjecting the epigenetic feature data set to multidimensional scaling, calculated 
coordinate vectors of the multidimensional scaling defining the candidate set of epigenetic features 
of interest and/or combinations of epigenetic features of interest. 

19. (Withdrawn) The method as recited in claim 13 wherein the defining the candidate 
set of epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by subjecting the epigenetic feature data set to isometric feature mapping, calculated 
coordinate vectors of the isometric feature mapping defining the candidate set of epigenetic 
features of interest and/or combinations of epigenetic features of interest. 

20. (Withdrawn) The method as recited in claim 13 wherein the defining the candidate 
set of epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by subjecting the epigenetic feature data set to cluster analysis and then combining 
epigenetic features of interest belonging to a same cluster so to define said candidate set of 
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epigenetic features of interest and/or combinations of epigenetic features of interest. 

21. (Withdrawn) The method as recited in claim 20 wherein the cluster analysis 
includes hierarchical clustering. 

22. (Withdrawn) The method as recited in claim 20 wherein the cluster analysis 
includes k-means clustering. 

23. (Withdrawn) The method as recited in claim 13 wherein the defining the candidate 
set of epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed using predetermined biological information. 

24. (Withdrawn) The method as recited in claim 23 wherein the biological information 
includes at least one biological factor selected from the group consisting of: correlated methylation 
status, proximity of epigenetic features to each other on a genome, epigenetic features located on a 
same gene, epigenetic features that are a exon/intron/promoter of a same gene, epigenetic features 
located on genes that are co-regulated, epigenetic features located on genes that have similar 
biological functionality, and epigenetic features located on genes that are part of the same 
biological pathway. 

25. (Previously presented) The method as recited in claim 13, wherein the feature 
selection criterion includes a training error of the machine learning classifier trained on respective 
epigenetic feature data of the epigenetic feature data set corresponding to the candidate set of 
epigenetic features of interest and/or combinations of epigenetic features of interest. 

26. (Withdrawn) The method as recited in claim 13 wherein the feature selection 
criterion includes a risk of the machine learning classifier trained on epigenetic feature data 
corresponding to the candidate set of epigenetic features of interest and/or combinations of 
epigenetic features of interest. 

27. (Withdrawn) The method as recited in claim 13 wherein the feature selection 

DWT 1 12973 15vl 0047675-000155 6 
Seattle 



Application No.: 10/672,515 
Attorney Docket No.: 47675-155 
First Applicant's Name: Peter Adorjan 
Application Filing Date: September 25, 2003 
Office Action Dated: December 13, 2007 
Date of Response: June 13, 2008 
Examiner: Russell Scott Negin 

criterion includes a bound on a risk of the machine learning classifier trained on epigenetic feature 
data corresponding to the candidate set of epigenetic features of interest and/or combinations of 
epigenetic features of interest. 

28. (Withdrawn) The method as recited in claim 13 wherein the feature selection 
criterion includes a statistical test for computing a significance of difference of the phenotypic 
classes of interest given epigenetic feature data corresponding to the candidate set of epigenetic 
features of interest and/or combinations of epigenetic features of interest. 

29. (Withdrawn) The method as recited in claim 28 wherein the statistical test includes 

a t-test. 

30. (Withdrawn) The method as recited in claim 28 wherein the statistical test includes 
a rank test. 

31. (Withdrawn) The method as recited in claim 30 wherein the rank test includes a 
Wilcoxon rank test. 

32. (Withdrawn) The method as recited in claim 28 wherein the statistical test includes 
a multivariate test. 

33. (Withdrawn) The method as recited in claim 32 wherein the multivariate test 
includes a T 2 -test. 

34. (Withdrawn) The method as recited in claim 32 wherein the multivariate test 
includes a likelihood ratio test for logistic regression models. 

35. (Withdrawn) The method as recited in claim 13 wherein the feature selection 
criterion includes a Fisher criterion for the phenotypic classes of interest given the epigenetic 
feature data corresponding to the candidate set of epigenetic features of interest and/or 
combinations of epigenetic features of interest. 

36. (Withdrawn) The method as recited in claim 13 wherein the feature selection 
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criterion includes weights of a linear discriminant for the phenotypic classes of interest given 
epigenetic feature data corresponding to the candidate set of epigenetic features of interest and/or 
combinations of epigenetic features of interest. 

37. (Withdrawn) The method as recited in claim 36 wherein the linear discriminant is a 
Fisher discriminant. 

38. (Withdrawn) The method as recited in claim 36 wherein the linear discriminant is a 
discriminant of a support vector machine classifier for the phenotypic classes of interest trained on 
epigenetic feature data corresponding to the candidate set of epigenetic features of interest and/or 
combinations of epigenetic features of interest. 

39. (Withdrawn) The method as recited in claim 13 wherein the defining the epigenetic 
feature selection criterion includes subjecting epigenetic feature data corresponding to the 
candidate set of epigenetic features of interest and/or combinations of epigenetic features of 
interest to principal component analysis and calculating weights of a first principal component. 

40. (Withdrawn) The method as recited in claim 13 wherein the epigenetic feature 
selection criterion includes an average pairwise correlation between all single features in a given 
subset of epigenetic features on a given set of samples. 

41. (Withdrawn) The method as recited in claim 13 wherein the epigenetic feature 
selection criterion includes mutual information between the phenotypic classes of interest and a 
classification achieved by an optimally selected threshold on s given epigenetic feature of interest. 

42. (Withdrawn) The method as recited in claim 13 wherein the epigenetic feature 
selection criterion includes a number of correct classifications achieved by an optimally selected 
threshold on a given epigenetic feature of interest. 

43. (Withdrawn) The method as recited in claim 13 wherein the epigenetic feature 
selection criterion includes eigenvalues of the principal components. 
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44. (Previously presented) The method as recited in claim 13, wherein the selecting the 
highest ranking epigenetic features of interest and/or combinations of epigenetic features of 
interest is performed by selecting a defined number of highest ranking epigenetic features of 
interest and/or combinations of epigenetic features of interest. 

45. (Withdrawn) The method as recited in claim 13 wherein the selecting the highest 
ranking epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by selecting all except a defined number of lowest ranking epigenetic features of 
interest and/or combinations of epigenetic features of interest. 

46. (Withdrawn) The method as recited in claim 13 wherein the selecting the highest 
ranking epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by selecting epigenetic features of interest and/or combinations of epigenetic features 
of interest with a feature selection criterion score greater than a defined threshold. 

47. (Withdrawn) The method as recited in claim 13 wherein the selecting the highest 
ranking epigenetic features of interest and/or combinations of epigenetic features of interest is 
performed by selecting epigenetic features of interest and/or combinations of epigenetic features 
of interest with a feature selection criterion score lower than a defined threshold. 

48. (Previously presented) The method as recited in claim 2, wherein repeating steps g) 
and h) is performed until a defined number of the epigenetic features of interest and/or 
combinations of epigenetic features of interest are selected. 

49. (Previously presented) The method as recited in claim 2, wherein repeating steps g) 
and h) is performed until all epigenetic features of interest and/or combinations of epigenetic 
features of interest of the epigenetic features of interest and/or combinations of epigenetic features 
of interest with a feature selection criterion score greater than a defined threshold are selected. 

50. (Previously presented) The method as recited in claim 2, further comprising 
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determining an optimal number of epigenetic features of interest and/or combinations of 
epigenetic features of interest using cross validation of a machine learning classifier on test subsets 
of epigenetic feature data. 

51. (Previously presented) The method as recited in claim 13, further comprising 
determining an optimal feature selection criterion score threshold by crossvalidation of the 
classifier on test subsets of epigenetic feature data. 

52. (Previously presented) The method as recited in claim 1, further comprising 
training a machine learning classifier using a feature data set corresponding to the defined new set 
of epigenetic features of interest. 

53. -97. (Cancelled) 
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